[ComfyUI]: ComfyUI integration by fhfuih · Pull Request #1113 · vllm-project/vllm-omni

fhfuih · 2026-01-30T15:16:24Z

Signed-off-by: Huang, Zeyu 11222265+fhfuih@users.noreply.github.com

Purpose

Design a one-in-all ComfyUI Integration for vLLM-Omni.

Close #900 (discussion about the UI design can go there)

Draft progress

Regular diffusion image generation
Omni models (e.g., Qwen Omni series)
Mixed multi-stage image generation (e.g., Bagel)
Validation
Codebase type annotation and documentation
Audio generation
UI Beautification
~~Video generation~~ Pending API support ([Feature] Support Wan2.2 T2V and I2V Online Serving with OpenAI /v1/videos API #1073, [Bug]: Diffusion chat completion failed: 'numpy.ndarray' object has no attribute 'save' #793)
(Optional) Publication to ComfyUI Registry

Features I have experimented:

(This section is also added to plugin README)

The following features are tested:

Single-node workflows for
- Multimodal Comprehension (e.g., Qwen Omni, BAGEL)
- Text-to-Image Generation (e.g., Qwen-Image)
- Image-to-Image Generation (e.g., Qwen-Image-Edit)
- TTS (e.g., Qwen TTS, including VoiceDesign, VoiceClone, CustomVoice)

The following features are not currently tested. They will be tested in the future, and the READMEs will be updated accordingly

Multi-node workflow that connects multiple model services together.

Release Note

Officially support ComfyUI via a plugin at apps/ComfyUI-vLLM-Omni. Please check out the README in this folder for installation instructions.

Test Plan

No test for now. The test is difficult to add due to the following reasons:

the source code imports ComfyUI internal files. This is possible in runtime because the source code itself will be placed in ComfyUI internal directory. However, the test files would need to mock them.
Ideal test cases should run a real vllm service with mocked AsyncOmni in a subprocess. This is difficult to achieve, and may be introduced in another PR later.

For now, we rely on the existing entrypoint API tests to ensure that the API doesn't change.

The tests described above are WIP in my other branch https://github.com/fhfuih/vllm-omni/tree/comfyui-test. I will create another PR when it is ready.

Test Result

N/A

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. TODO: Will add later
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

Screenshots:

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

david6666666 · 2026-02-06T02:54:53Z

should we add follow features:

LoRA
Combinations of different outputs, such as image generation + image editing

etc...

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ff46ca8033

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/nodes.py

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

david6666666 · 2026-02-06T09:19:10Z

@wtomin @SamitHuang @ZJY0516 ptal thx

Copilot

Pull request overview

This PR introduces a comprehensive ComfyUI integration for vLLM-Omni, enabling visual workflow-based inference for multimodal AI models through ComfyUI's node system. The integration provides nodes for image generation, multimodal comprehension, and text-to-speech tasks, supporting both single-stage and multi-stage model pipelines with configurable sampling parameters.

Changes:

Adds ComfyUI custom nodes for vLLM-Omni online serving API
Implements API client with support for image generation, editing, comprehension, and TTS
Provides sampling parameter nodes for autoregression and diffusion stages
Includes documentation, example workflows, and CI/CD workflows for publishing to ComfyUI registry

Reviewed changes

Copilot reviewed 30 out of 36 changed files in this pull request and generated 12 comments.

Show a summary per file

File	Description
`apps/ComfyUI-vLLM-Omni/__init__.py`	Plugin entry point defining node mappings and display names
`apps/ComfyUI-vLLM-Omni/vllm_omni/nodes.py`	Core node implementations for generation and sampling parameters
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/api_client.py`	Async HTTP client for vLLM-Omni API endpoints
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/format.py`	Format conversion utilities for images, video, and audio
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/validators.py`	Validation logic for model specs and sampling parameters
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/models.py`	Model pipeline specifications and payload preprocessors
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/logger.py`	Logging configuration with base64 redaction
`apps/ComfyUI-vLLM-Omni/vllm_omni/utils/types.py`	Type definitions for audio formats and model specifications
`apps/ComfyUI-vLLM-Omni/web/main.js`	Frontend extension (mostly commented out)
`apps/ComfyUI-vLLM-Omni/web/utils.js`	Multiline text widget utilities
`apps/ComfyUI-vLLM-Omni/pyproject.toml`	Package configuration and metadata
`apps/ComfyUI-vLLM-Omni/README.md`	User-facing documentation and quickstart guide
`apps/ComfyUI-vLLM-Omni/LICENSE`	Apache 2.0 license
`tests/comfyui/test_example.py`	Basic smoke test for node instantiation
`tests/comfyui/conftest.py`	Test configuration for path setup
`.github/workflows/comfyui-validate.yml`	CI workflow for backward compatibility validation
`.github/workflows/comfyui-publish.yml`	CI workflow for publishing to ComfyUI registry
`.github/workflows/build_wheel.yml`	Updated to exclude apps directory from build triggers
`docs/features/comfyui.md`	Feature documentation for the integration
`docs/.nav.yml`	Added ComfyUI to documentation navigation
`.gitignore`	Allows example workflow JSON files

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

apps/ComfyUI-vLLM-Omni/README.md

apps/ComfyUI-vLLM-Omni/web/utils.js

docs/features/comfyui.md

apps/ComfyUI-vLLM-Omni/README.md

apps/ComfyUI-vLLM-Omni/web/utils.js

tests/comfyui/test_example.py

apps/ComfyUI-vLLM-Omni/README.md

apps/ComfyUI-vLLM-Omni/web/utils.js

apps/ComfyUI-vLLM-Omni/web/main.js

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fhfuih · 2026-02-06T09:56:39Z

Combinations of different outputs, such as image generation + image editing

@david6666666 What do you mean by "Combinations of different outputs"? In the current design, the "Generate Image" node can handle both image generation and image editing. Depending on whether there is an input image, it routes to the correct API endpoint with correct payload. Is this what you mean?

What I'm looking for is that the generated image can be connected to another API, v1/images/edit, for image editing, similar to a workflow.

Ah yes, we have had this discussion today, and now the readme and this PR have added a notice that connecting multiple model services are not tested. I can help test in the future and add relevant documentation and example workflows.

LoRA

And LoRA as well!

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

david6666666 · 2026-02-06T11:22:51Z

LGTM, look forward to follow pr

apps/ComfyUI-vLLM-Omni/AGENTS.MD

hsliuustc0106 · 2026-02-06T13:43:14Z

in the follow-up PR, please test Hunyuan Image 3.0 instruct model, we are going to use this model for demonstration and blogpost

Copilot

Pull request overview

Copilot reviewed 22 out of 27 changed files in this pull request and generated 18 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/nodes.py

apps/ComfyUI-vLLM-Omni/web/main.js

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/utils/validators.py

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/utils/models.py

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/utils/api_client.py

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fhfuih changed the title ~~[ComfyUI]: ComfyUI integration for image generation~~ [ComfyUI]: ComfyUI integration Jan 30, 2026

fhfuih force-pushed the comfyui branch 5 times, most recently from 1890455 to 894dff2 Compare February 2, 2026 03:03

wtomin mentioned this pull request Feb 2, 2026

[RFC]: Diffusion Models Features Supports Plan #814

Open

53 tasks

fhfuih force-pushed the comfyui branch from 8e422c5 to 9cf7a4b Compare February 2, 2026 10:17

david6666666 self-requested a review February 4, 2026 02:34

fhfuih mentioned this pull request Feb 4, 2026

[Feature]: ComfyUI integration JiusiServe/vllm-omni#94

Open

1 task

This was referenced Feb 4, 2026

[RFC]: Core Diffusion Features Support Plan JiusiServe/vllm-omni#103

Closed

[RFC]: ComfyUI web serving integration JiusiServe/vllm-omni#106

Open

fhfuih added 15 commits February 6, 2026 03:34

[ComfyUI]: ComfyUI integration for image generation

296a870

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

Support Qwen Omni series

34e1697

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI]: fix qwen omni nodes and validation logic

3915eba

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI]: support BAGEL image gen and fix Qwen Image

92b847f

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI]: support BAGEL text gen

09696bb

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI]: doc and format

4dbc40f

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] README

eac5450

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] Add Qwen TTS

8fdac9d

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[bugfix] ComfyUI TTS for Voice clone

1646377

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] readme

a454292

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] remove redundant linting for pre-commit

dd4013b

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[comfyui] update doc

4db7195

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

clean up code

a6abdb5

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] add doc

7218522

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

[ComfyUI] clean up logging

c5393b8

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fhfuih force-pushed the comfyui branch from 480dfd6 to 0eddf52 Compare February 6, 2026 03:36

fix previous commit

ea6751f

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

Copilot AI review requested due to automatic review settings February 6, 2026 09:14

Copilot started reviewing on behalf of fhfuih February 6, 2026 09:14 View session

chatgpt-codex-connector bot reviewed Feb 6, 2026

View reviewed changes

apps/ComfyUI-vLLM-Omni/comfyui_vllm_omni/nodes.py Show resolved Hide resolved

fhfuih added 5 commits February 6, 2026 17:17

bugfix: parameter name mismatch

7b68896

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

doc: add notice about test range

d54ebba

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

doc: add example workflows

6742b9f

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

chore: remove unused auxiliary files

2ad4e06

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

doc: add images in README

ad73416

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

Gaohan123 added the ready label to trigger buildkite CI label Feb 6, 2026

Copilot AI reviewed Feb 6, 2026

View reviewed changes

fhfuih force-pushed the comfyui branch from b03ecaf to ed02d3e Compare February 6, 2026 09:25

tests: rename comfyui plugin root folder name

74e2705

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fhfuih force-pushed the comfyui branch from ed02d3e to 74e2705 Compare February 6, 2026 09:31

fhfuih added 2 commits February 6, 2026 17:44

fix: AI code review

7086b96

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fix: ci ruff

3efa9ad

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fhfuih force-pushed the comfyui branch from e5d1555 to 3efa9ad Compare February 6, 2026 09:44

fhfuih added 2 commits February 6, 2026 17:50

fix: ruff ci

ac03b63

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fix: ruff ci

4a1a32b

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fix: MKDOC list rendering issue

693e06f

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

david6666666 approved these changes Feb 6, 2026

View reviewed changes

hsliuustc0106 reviewed Feb 6, 2026

View reviewed changes

apps/ComfyUI-vLLM-Omni/AGENTS.MD Outdated Show resolved Hide resolved

hsliuustc0106 requested a review from Copilot February 6, 2026 13:42

Copilot started reviewing on behalf of hsliuustc0106 February 6, 2026 13:42 View session

Copilot AI reviewed Feb 6, 2026

View reviewed changes

fhfuih added 2 commits February 6, 2026 23:22

chore: remove AI instructions and gitignore them

873f824

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

fix: more AI reviews

b78e877

Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>

Conversation

fhfuih commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Draft progress

Features I have experimented:

Release Note

Test Plan

Test Result

Uh oh!

david6666666 commented Feb 6, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

david6666666 commented Feb 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fhfuih commented Feb 6, 2026

Uh oh!

david6666666 commented Feb 6, 2026

Uh oh!

Uh oh!

hsliuustc0106 commented Feb 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fhfuih commented Jan 30, 2026 •

edited

Loading